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The arrow of time dilemma: the laws of physics are invariant for time inversion, whereas the 
familiar phenomena we see everyday are not (i.e. entropy increases). I show that, within a quantum 
mechanical framework, all phenomena which leave a trail of information behind (and hence can 
be studied by physics) are those where entropy necessarily increases or remains constant. All 
phenomena where the entropy decreases must not leave any information of their having happened. 
This situation is completely indistinguishable from their not having happened at all. In the light of 
this observation, the second law of thermodynamics is reduced to a mere tautology: physics cannot 
study those processes where entropy has decreased, even if they were commonplace. 



PACS numbers: 03.67.-a,03.65.Ud 



Paradoxes have always been very fruitful in stimulating 
advances in physics. One which still lacks a satisfactory 
explanation is the Loschmidt paradox Namely, how 
can we obtain irreversible phenomena from reversible 
time-symmetric physical laws [||? The irreversibility in 
Physics is summarized by the second law of thermody- 
namics: entropy, which measures the degradation of the 
usable energy in a system, never decreases in isolated 
systems. Many approaches have been proposed to solve 
this conundrum, but most ultimately resort to postulat- 
ing low entropy initial states (see e.g. ||, f|), which is 
clearly an ad hoc assumption || . Others suggest that the 
thermodynamic arrow of time is in some way connected 
to the cosmological one ||, that physical laws must be 
modified to embed irreversibility J7J, that irreversibility 
arises from decoherence Q , or from some time-symmetric 
mechanism embedded in quantum mechanics U , etc. Re- 
cent reviews on this problem are given in Ref . JlO| . 

Here I propose a different approach, based on exist- 
ing laws of physics (quantum mechanics). I show that 
entropy in a system can both increase and decrease (as 
time reversal dictates), but that all entropy-decreasing 
transformations cannot leave any trace of their having 
happened. Since no information on them exists, this is 
indistinguishable from the situation in which such trans- 
formations do not happen at all: "The past exists only 
insofar as it is recorded in the present" Then the 

second law is forcefully valid: the only physical evolu- 
tions we see in our past, and which can then be studied, 
are those where entropy has not decreased. 

I start by briefly relating the thermodynamic entropy 
with the von Neumann entropy, and introducing the sec- 
ond law. I then present two thought experiments, where 
entropy is deleted together with all records of the entropy 
increasing processes: even though at some time the en- 
tropy of the system had definitely increased, afterward 
it is decreased again, but none of the observers can be 
aware of it. I conclude with a general derivation through 
the analysis of the entropy transfers that take place in 
physical transformations. 

Entropy and the second law. Thermodynamic en- 
tropy is a quantity that measures how the usable en- 
ergy in a physical process is degraded into heat. It 



can be introduced in many ways from different axiom- 
atizations of thermodynamics. The von Neumann en- 
tropy of a quantum system in the state p is defined 
as S(p) = — Tr[plog 2 p]. When applicable, these two 
entropies coincide (except for an inconsequential mul- 
tiplicative factor). This derives from an argument in- 
troduced by Einstein |L2[ and extended by Peres 
(e.g. both the canonical and the microcanonical ensem- 
ble can be derived from quantum mechanical consider- 
ations |l4|, jf^])- For °ur purposes, however, it is suffi- 
cient to observe that thermodynamic and von Neumann 
entropies can be inter-converted, emp loy ing Maxwell- 
demons (r], |l^] or Szilard-engines Jl8|, Q: useful work 
can be extracted from a single thermal reservoir by in- 
creasing the von Neumann entropy of a memory space. 

There are many different formulations of the second 
law, but we can summarize them by stating that, in 
any process in which an isolated system goes from one 
state to another, its thermodynamic entropy does not 
decrease [ pp| . There is a hidden assumption in this state- 
ment. Whenever an isolated system is obtained by join- 
ing two previously isolated systems, then the second law 
is valid only if the two systems are initially uncorrelated, 
i.e. if their initial joint entropy is the sum of their individ- 
ual entropies. It is generally impossible to exclude that 
two systems might be correlated in some unknown way 
and there is no operative method to determine whether 
a system is uncorrelated from all others (e.g. given a box 
containing some gas it is impossible to exclude that the 
gas particles might be correlated with other systems). 
Thus, in thermodynamics all systems are considered un- 
correlated, unless it is known otherwise. Without this 
assumption, it would be impossible to assign an entropy 
to any system unless the state of the whole universe 
is known: a normal observer is limited in the informa- 
tion she can acquire and on the control she can apply. 
This implies that thermodynamic entropy is a subjective 
quantity pl[ , even though for all practical situations this 
is completely irrelevant: the eventual correlations in all 
macroscopic systems are practically impossible to con- 
trol and exploit. Even though they are ignored by the 
normal observer, correlations between herself and other 
systems do exist. Until they are eliminated, the other sys- 
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terns cannot decrease their entropy. A physical process 
may either reduce or increase these correlations. When 
they are reduced, this may seem to entail a diminish- 
ing of the entropy, but the observer will not be aware of 
it as her memories are correlations and will have been 
erased by necessity (each bit of memory is one bit of 
correlation and, until her memory has been erased, the 
correlations are not eliminated). Instead, when the phys- 
ical process increases these correlations, she will see it as 
an increase in entropy. The observer will then only be 
aware of entropy non-decreasing processes. [Not even a 
super-observer that can keep track of all the correlations 
would ever see any entropy decrease. In fact, since he can 
discover and take advantage of all correlations between 
microscopic degrees of freedom, all processes are always 
zero-entropy processes from his super-observer point of 
view.] 

The above analysis is limited to systems that are some- 
how correlated with the observer. One might then expect 
that she could witness entropy decreasing processes in 
systems that are completely factorized from her. That is 
indeed the case: statistical microscopic fluctuations can 
occasionally decrease the entropy of a system (the second 
law has only a statistical valence). However, an observer 
is macroscopic by definition, and all remotely interact- 
ing macroscopic systems become correlated very rapidly 
(e.g. Borel famously calculated that moving a gram of 
material on the star Sirius by one meter can influence 
the trajectories of the particles in a gas on earth on a 
time-scale of ps |2^] ) . This is the same mechanism at the 
basis of quantum decoherence J8|, and it entails that in 
practice the above analysis applies to all situations: no 
entropy decrease in macroscopic systems is ever observed. 

In what follows I will make these ideas rigorous. 

Since the two above definitions of entropy are equiv- 
alent, the von Neumann entropy also obeys the second 
law. In fact, isolated systems evolve with unitary evolu- 
tions, which leave the von Neumann entropy invariant. 
There may be an increase if the evolution is not exactly 
known or if it creates unknown correlations among sub- 
systems. In the first case, the coarse-grained evolution is 
of the form p' = J2 n PnU n pU n \ where p n is a probabil- 
ity and U n are unitary operators. Then the final entropy 
S(p') may be larger than the initial entropy S(p): 

S(p) = SiJ^PnUnpUj) ^ J2P nS ( Un P Un ^ = S (PW 
n n 

(the inequality follows from the concavity of the entropy) . 
In the second case, the entropy of subsystems can in- 
crease, as some unknown correlations between them may 
build up: 

S(A) + S(p' 2 ) > S(U( Pl ® p 2 )tf) = S( Pl ) + S{ P2 ) , (2) 

where p\ and pi are the final and initial states of the 
subsystems, U is the evolution coupling them, and the 
last equality holds if they are initially uncorrelated (the 
inequality follows from the subadditivity of the entropy) . 



Thought experiments. The quantum information the- 
ory mantra "Information is physical" |2^] implies that 
any record [M of an occurred event can be decorrelated 
from such event by an appropriate physical interaction. 
If all the records of an event are decorrelated from it, 
then by definition there is no way to know whether this 
event has ever happened. This situation is indistinguish- 
able from its not having happened. If this event has in- 
creased the entropy, the subsequent erasing of all records 
can (will) produce an entropy decrease without violation 
of any physical law. We now analyze two such situations, 
an imperfect transmission of energy and a quantum mea- 
surement. 

Alice's lab is perfectly isolated, so that to an outside 
observer (Bob), its quantum evolution is unitary. Ana- 
lyze the situation in which Bob sends Alice some energy 
in the form of light, a multimode electromagnetic field 
in a zero-entropy pure state. We suppose that, to se- 
cure the energy Bob is sending her, she uses many de- 
tectors which are not matched to his modes. Given a 
system in almost any possible pure state, all its subsys- 
tems which are small enough are approximately in the 
canonical state |Q. This implies that, if each of Alice's 
detectors is sensitive to only a small part of Bob's modes, 
the detectors mostly see thermal radiation, and she feels 
them warming up. She will then be justified in assigning 
a nonzero thermodynamic entropy to her detectors, as 
she sees them basically as thermal-equilibrium systems. 
One might object that she is mistaken, since the states 
of the detectors are not uncorrelated. However, since she 
ignores the correlations, she cannot use such correlations 
to extract energy from the detectors. Alice concludes 
that most of the energy Bob sent her has been wasted 
as heat, raising the thermodynamic entropy of her lab. 
Suppose now that Bob has complete control of all the 
degrees of freedom in her lab. He knows and can exploit 
the correlations to recover all the energy he had initially 
given Alice. Of course, although possible in principle, 
he needs a dauntingly complex transformation, which re- 
quires him to be able to control a huge number of her 
lab's degrees of freedom (including the brain cells where 
her memories are, and the notepads where she wrote the 
temperatures!). To extract the energy, since it was ini- 
tially locked in a pure state of the field, he must return 
it to a system in a zero-entropy pure state, i.e. factor- 
ized from all the other degrees of freedom of Alice's lab. 
Then he must erase all the correlations between them: at 
the end of Bob's recovery, Alice cannot remember feeling 
her detectors warm up, they are cool again, her notepads 
contain no temperature information, and all the energy 
initially in the electromagnetic field is again available, 
even though (from Alice's point of view) most of it was 
definitely locked into thermal energy at one time. 

The second though-experiment (25) is a prototypical 
quantum measurement. Bob prepares a spin- 1/2 parti- 
cle oriented along the x axis, e.g. in a spin | — >) state 
and hands it to Alice. She sends it through a Stern- 
Gerlach apparatus oriented along the z axis p3[. The 
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FIG. 1: a) Alice in her isolated lab performs a Stern-Gerlach 
measurement on a spin 1/2 particle initially oriented paral- 
lel to the x axis, i.e. in a state | — »). Since the apparatus is 
oriented along the z axis, this measurement creates one bit 
of entropy for Alice (not for Bob, who is isolated from her 
lab), b) Bob flips a switch that "cancels" Alice measurement 
by decorrelating from the spin all those degrees of freedom of 
her lab that have recorded the measurement outcome. Now 
the spin is returned to its initial state | — >) and Alice can- 
not have any memory of what her measurement result was. 
Her entropy has decreased, but she cannot remember it ever 
having increased. 

measurement consists in coupling the quantum system 
with some macroscopic degrees of freedom (a reservoir), 
not all of which are under the control of the exper- 
imenter ]2ql , whence the irreversibility. Notice that, 
I -*•) = (I T} + | l))/V2, where | |) and | [) are the eigen- 
states of a z measurement operator. Hence, this appara- 
tus will increase the entropy of the spin system by one 
bit Jig] : Before the readout, the spin state will be in the 
maximally mixed state (| t)(T l + l I) (I DA After Alice 
has looked at the result, she has transferred this one bit 
of entropy, created by the measurement, to her memory. 
From the point of view of Bob, outside her isolated lab, 
Alice's measurement is simply a (quantum) correlation 
of her measurement apparatus to the spin. [A thorough 
analysis of the microscopic details and of the thermody- 
namics of this type of measurement is given in Ref . [jl6| .] 
The initial state of the spin | ->) = (| T ) + I I ))/v / 2 
evolves into the correlated (entangled) state 

(| t ) | Alice sees "up") + | J, )| Alice sees "down"))/%/2 ,(3) 

where the first ket in the two terms refers to the spin 
state, whereas the second ket refers to the rest of Al- 
ice's lab. Thus, from the point of view of Bob, Alice's 
measurement is an evolution similar to a controlled-NOT 
unitary transformation of the type U cnot (\0} + |1))|0) = 
|0)|0) + Such a transformation can be easily in- 

verted, as it is its own inverse. Analogously, Bob can 
flip a switch and invert Alice's measurement. At the end 
of his operation, all records of the measurement result 
(Alice's notepad, her brain cells, the apparatus gauges) 
will have been decorrelated from the spin state. She will 
remember having performed the measurement, but she 



will be (must be) unable to recall what the measurement 
result was. In addition, the spin has become uncorre- 
lated from the measurement apparatus, so it is returned 
to a pure state. I emphasize that Bob's transformation 
is not necessarily a reversion of the dynamics of Alice's 
lab. [Notice that a "quantum eraser" |^7j only permits 
to decide a posteriori which of two complementary mea- 
surements to perform using previously collected data: the 
measurement process is not actually erased, and the en- 
tropy does not decrease.] 

In both the above experiments, from Alice's point of 
view, entropy definitely has been created after she has 
interacted with Bob's light or his spin. However, this 
entropy is subsequently coherently erased by Bob. At the 
end of the process, looking back at the evolution in her 
lab, she cannot see any violation of the second law: she 
has no (cannot have any) record of the fact that entropy 
at one point had increased. 

Entropic considerations. The above thought exper- 
iments exemplify a general situation: entropy can de- 
crease, but its decrease is accompanied by an erasure of 
any memory that the entropy-decreasing transformation 
has occurred. In fact, any interaction between an ob- 
server A and a system C which decreases their entropy by 
a certain quantity, must also reduce their quantum mu- 
tual information by the same amount (unless, of course, 
the entropy is dumped into a reservoir R) . The quantum 
mutual information S(A : C) = S(pa) + S(pc) — S(pac) 
measures the amount of shared quantum correlations be- 
tween the two systems A and C (pAC being the state of 
the system AC, and pA and pc its partial traces, i.e. the 
states of A and C). 

Taking the cue from (2£| , I now prove the above asser- 
tion, namely I show that 

AS(A) + AS(C) - AS(R) - AS(A : C) = , (4) 

where AS'(A") = S t (px) — So(px) is the entropy differ- 
ence between the final state at time t and the initial state 
of the system X, and where AS (A : C) = S t (A : C) - 
Sq(A : C) is the quantum mutual information difference. 
Choose the reservoir R so that the joint state of the sys- 
tems ACR is pure and so that the evolution maintains the 
purity (i? is a purification space) . Then the initial and fi- 
nal entropies are S (AC) = S (R) and S t (AC) = S t (R), 
respectively. Thus we find S (AB) = S t (AB) - AS(R) 
which, when substituted into the left-hand-side term of 
(Q), shows that this term is null. [This proof is valid also 
if the evolution is not perfectly known, i.e. if it is given 
by a random unitary map, see Eq. ([l]).] 

Now, to prove that the above reduction of entropy en- 
tails a memory erasure, I show that this erasure must 
follow from the elimination of quantum mutual informa- 
tion. A memory of an event is a physical system A which 
has nonzero classical mutual information on a system C 
that bears the consequences of that event. Then, the 
erasure of the memory follows from an elimination of the 
quantum mutual information S(A : C) if this last quan- 
tity is an upper bound to the classical mutual information 
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I (A : C). Thus, we must show that for any POVM mea- 
surement {n^°' ) eg) 11^ } extracting information separately 
from the two systems (11^ acting on A and LL^ on C), 

S(A : C) ^ I {A : C) , (5) 

where I (A : C) is the mutual information of the POVM's 
measurement results. A simple proof of this statement 
exists (e.g. see |^9[ ||^]): use the equality S(A : C) — 
S(pac\\pa®Pc), where S(p\\a) =Tr[plog 2 p-plog 2 a] is 
the quantum relative entropy. Thi s q uantity is mono- 
tone for application of CP-maps p9(| , i.e. S(p\\cr) ^ 
5(A/'[ / o]||A/'[cr]) for any transformation J\f that can be 
written as Af[p] = J2k AkpA^ , with the Kraus opera- 
tors Ak satisfying ^uAkAk = 1. Consider the "mea- 
sure and reprepare" channel, i.e. the transformation 
N[p] — J2 n Tr[II„p] \n)(n\ where {\n}} is a basis, and U n 
is a POVM element (i.e. a positive operator such that 
52 n n = 1). It is a CP-map, since it has a Kraus form 

A nm = \n)(vW\ V p£\ with H„ = J2p^\ v ^)( v ^\ -(6) 

m 

Using the monotonicity of the relative entropy under the 
action of the map Af, we find 

S(A : C) = S{p AC \\pA ® Pc) > S{N[p AC ] II M[p A ® Pc]) 
= Pij lo S2Py - Fy log 2 (%^') = ^(^4 : C) , 

where Pij =Tr[Ilj a) ® p AC ], q t =Tr[n| a) P A], and 
r,=Tr[n< c V c ]. 

The interpretation of Eq. (|4|) is that, if we want to de- 
crease the entropy of the system C (somehow correlated 
with the observer A) without increasing the entropy of a 
reservoir R, we need to reduce the quantum mutual infor- 
mation between C and the observer A (e.g. in the Stern- 
Gerlach thought experiment, the system A is Alice's lab 
and C is the spin-1/2 particle: their final entropies are 
reduced by one bit at the expense of erasing two bits of 
quantum mutual information Sq(A : G)). The fact that 
mutual information can be used to decrease entropy was 
already pointed out by Lloyd (2^| and Zurek J3lf . 

The implications of the above analysis can be seen ex- 
plicitly by employing Eq. (Q) twice, by considering an 
intermediate time when S{C) is higher than at the ini- 
tial and final times. The entropy S(C) of the system is 
high at the intermediate time after an entropy-increasing 
transformation, and then (if no entropy-absorbing reser- 
voir R is used) it can be reduced by a successive entropy- 
decreasing transformation at the cost of reducing the 
mutual information between the observer and C . Even 
though the entropy S{C) (as measured from the point 
of view of the observer A) does decrease, the observer is 
not aware of it, as the entropy-decreasing transformation 
must factorize her from the system C containing infor- 
mation on the prior entropy- increasing event: her mem- 
ories of such event must be part of the destroyed corre- 
lations. The deep reason for this is that, from her own 



point of view, the Born rule kicks in when the observer 
becomes entangled with another system. (An external 
super-observer may, instead, just see her becoming en- 
tangled with the other system, but then he cannot know 
the measurement result.) The Born rule is the only place 
where quantum mechanics allows irreversibility, but the 
correlations that stem from such rule can be undone, at 
least if one treats both the observer and the apparatus 
quantum mechanically. This means that the measure- 
ment can be undone, at the price that all the observer's 
memories must be erased. 

What we have seen up to now is that any decrease in 
entropy of a system that is correlated with an observer 
entails a memory erasure of said observer, in the absence 
of reservoirs (or is a zero-entropy process for a super- 
observer that keeps track of all the correlations). That 
might seem to imply that an observer should be able to 
see entropy-decreasing processes when considering sys- 
tems that are uncorrelated from her. In fact, at micro- 
scopic level, statistical fluctuations do decrease occasion- 
ally the entropy. However, the correlations between any 
two macroscopic systems build up continuously, and at 
amazing rates this is how decoherence arises [||. 

Then no observer is really factorized with respect to any 
macroscopic system she observes. This implies that en- 
tropy decreases of a macroscopic system becomes unob- 
servable (unless extreme care is taken to shield the system 
under analysis). Only microscopic systems can be con- 
sidered factorized from an observer for a period of time 
long enough to see entropy decrease from fluctuations. 

Conclusions. In this paper I gave a quantum solu- 
tion to the Loschmidt paradox, showing that all physical 
transformations where entropy is decreased cannot re- 
linquish any memory of their having happened from the 
point of view of any observer: both normal observers that 
interact with the studied systems and external super- 
observers that keep track of all the correlations. Thus 
they are irrelevant to physics. Quantum mechanics is 
necessary to this argument. In the above derivation, we 
have used the property that the entropy of a joint system 
can be smaller than that of each of its subsystems. This 
is true of von Neumann entropy, but not true if entropy 
is calculated using classical probability theory: then the 
entropy of a joint system is always larger than that of 
its subsystem with largest entropy. By how much must 
any system be extended until we can take advantage of 
this quantum reduction of the global entropy? It is clear 
from Borel's famous arguments that the time scale in 
which a macroscopic system can be really considered as 
isolated is very small. As such, the arguments presented 
in this paper are of theoretical interest only, and have 
little or no practical consequence for any normal macro- 
scopic system: the effects presented here become relevant 
only at a scale that approaches the whole universe very 
rapidly. 

In closing, I indulge in a couple of more philosophi- 
cal considerations. In a quantum cosmological setting, 
the above approach easily fits in the hypothesis that the 
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quantum state of the whole universe is a pure (i.e. zero- 
entropy) state evolving unitarily (e.g. see |32| §)■ 
One of the most puzzling aspects of our universe is the 
fact that its initial state had entropy so much lower than 
we see today, making the initial state highly unlikely . 
Joining the above hypothesis of a zero-entropy pure state 
of the universe with the second law considerations ana- 
lyzed in this paper, it is clear that such puzzle can be 
resolved. The universe may be in a zero entropy state, 
even though it appears (to us, internal observers) to pos- 
sess a higher entropy: our situation is similar to the one 
of Alice, who, just after the measurement sees her lab in a 
nonzero entropy state, whereas to the super-observer Bob 
her lab maintains a zero-entropy state all along. How- 
ever, it is clear that this approach does not require to 
deal with the quantum state of the whole universe, but 
it applies also to arbitrary physical systems. 

In a quantum cosmological framework, Boltzmann's 
initial condition translates in the equivalent question of 
why the initial state of the universe is such that its sub- 
systems are mostly unentangled. Such a state is highly 
improbable, as all states in a sufficiently large Hilbert 
space are almost completely entangled |Q. The most 
compelling answer to this question derives from Davies' 
argument that, as space expands because of cosmologi- 
cal expansion and inflation, new degrees of freedom are 
created, giving the potential for accommodating new en- 
tropy [p5| : one may think that, as these new degrees of 
freedom are created, they are initialized in a factorized 
(possibly pure) state. However, this assumption is un- 



warranted, and moreover, our current understanding of 
quantum mechanics does not allow the description of a 
situation where the number of degrees of freedom of a 
system (and hence its Hilbert space) changes dynami- 
cally 1 36 . The alternative solution presented in this pa- 



per sidesteps the problem: whatever the state of the uni- 
verse, an internal observer would still only see the pro- 
cesses where entropy increases. 

In addition, I recall that there is a substantial problem 
in rigorously defining past and future without resorting 
to the second law (which would then be reduced to a 
mere definition). In fact, the laws of physics are time- 
reversal invariant. Hence, there is no preferred direction 
of time according to which we may establish a substantial 
difference between the two temporal directions past-to- 
future and future-to-past p7| . Anthropocentrically, we 
could define the past as that of which we have memories 
of, and the future as that of which we do not have any 
memories. Of course, such definition cannot be made rig- 
orous, since it resorts to a observers and their memories. 
However, even using this ambiguous, intuitive definition 
of past, it is clear that any event, which cannot have any 
correlation with us, does not pertain to our past just as 
if it had never happened. 

I thank S. Lloyd, G.M. D'Ariano, V. Giovannctti, G.P. 
Beretta, S. Wolf, and A. Winter for useful discussions and 
feedback. 
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